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MEASUREMENT OF OVERLAY USING DIFFRACTION GRATINGS 
WHEN OVERLAY EXCEEDS THE GRATING PERIOD 

INVENTORS: Rodney Smedt, Abdurrahman Sezginer, and Hsu-Ting Huang 

5 PRIORITY CLAIM 

The present appUcation claims priority to U.S. Provisional Patent Application Serial 
No. 60/426,156, filed November 14, 2002, which is incorporated in this document by 
reference. 

TECHNICAL FIELD 

10 This invention relates to measuring the pattem overlay alignment accuracy of a pair 

of patterned layers on a semiconductor wafer, possibly separated by one or more layers, 
made by two or more lithography steps during the manufacture of semiconductor devices. 

BACKGROUND OF THE INVENTION 

Manufacturing semiconductor devices involves depositing and patterning several 

15 overlaying layers. A typical semiconductor wafer might include, for example, a series of 
gates formed on a first layer and a series of interconnects formed on a second layer. The two 
layers (and their structures) are formed at different lithography steps in the manufacturing 
process. Alignment between the two layers is critical to ensure proper connection between 
the gates and their interconnects. Typically, this means that the tolerance of alignment must 

20 be less than the width of a single gate. 

Overlay is defined as the displacement of a pattemed layer from its ideal position 
aligned to a layer pattemed earlier on the same wafer. Overlay is a two dimensional vector 
(Ax, Ay) in the plane of the wafer. Overlay is a vector field, i.e., the value of the vector 
depends on the position on the wafer. Perfect overlay and zero-overlay are used 

25 synonymously. Overlay and overlay error are used synonymously. Depending on the 
context, overlay may signify a vector or one of the components of the vector. 

Overlay metrology provides the information that is necessary to correct the alignment 
of the stepper-scanner and thereby minimize overlay error with respect to previously 
pattemed layers. Overlay errors, detected on a wafer after exposing and developing the 

30 photoresist, can be corrected by removing the photoresist, repeating exposure on a corrected 
stepper-scanner, and repeating the development of the photoresist. If the measured error is 
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acceptable but measurable, parameters of the lithography process could be adjusted based on 
the overlay metrology to avoid excursions for subsequent wafers. 

Most prior overlay metrology methods use built-in test pattems etched or otherwise 
formed into or on the various layers during the same plurality of lithography steps that form 
5 the pattems for circuit elements on the wafer. One typical pattern, called "box-in-box" 

consists of two concentric squares, formed on a lower and an upper layer, respectively. "Bar- 
in-bar" is a similar pattem with just the edges of the "boxes" demarcated, and broken into 
disjoint line segments. The outer bars are associated with one layer and the inner bars with 
another. Typically one is the upper pattem and the other is the lower pattem, e.g., outer bars 

10 on a lower layer, and inner bars on the top. However, with advanced processes the 

topographies are complex and not truly planar so the designations "upper" and "lower" are 
ambiguous. Typically they correspond to earlier and later in the process. The squares or bars 
are formed by lithographic and other processes used to make planar structures, e.g., 
chemical-mechanical planarization (CMP). 

1 5 In one form of the prior art, a high performance microscope imaging system 

combined with image processing software estimates overlay error for the two layers. The 
image processing software uses the intensity of light at a multitude of pixels. Obtaining the 
overlay error accurately requires a high quality imaging system and means of focusing the 
system. One requirement for the optical system is very stable positioning of the optical 

20 system with respect to the sample. Relative vibrations blur the image and degrade the 

performance. Reducing vibration is a difficult requirement to meet for overlay metrology 
systems that are integrated into a process tool, like a lithography track. 

As disclosed in U.S. Patent Application Serial No. 2002/0158193; U.S. Patent 
Application 2003/0190793 Al; and as described in Proc. of SPIE Vol. 5038 February 2003 

25 "Scatterometry-Based Overlay Metrology" by Huang et al., p. 126-137 and "A novel 

diffraction based spectroscopic method for overlay metrology" by Yang et al. p. 200-207 (all 
four incorporated in this document by reference) one approach to overcome these difficulties 
is to use overlay metrology targets that are made of a stack of two diffraction gratings as 
shown in Figure 1 . The grating stack 10 has one grating 20 in the lower layer and another 

30 grating 30 in the upper layer as shown in Figure 1 . The layers of 20 and 30 are to be aligned. 
There are two instances of the grating stack 10, one for the x-component of overlay and one 
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for the y-component. The measurement instrument is such that it does not resolve individual 
grating lines. It measures overall optical properties of the entire grating. Optical properties 
are measured as a function of wavelength, polar or azimuthal angle of incidence, polarization 
states of the illumination and the detected light, or any combination of these independent 
5 variables. An alternative embodiment uses two stacks of line gratings to measure x-overlay 
and two stacks of line gratings to measure y-overlay (four grating stacks total). Still another 
embodiment uses three line grating stacks in combination to simultaneously measure both x 
and y alignment. (See also PCT publication WO 02/25723 A2, incorporated herein by 
reference). Scatterometry (diffraction) is proving to be an effective tool for measuring 
10 overlay. 

A shortcoming of the prior scatterometry-based art is that, diffraction gratings cannot 
distinguish overlay values that differ by an integer number of periods. Let R(X.,9, 4) denote 
the specular (0-th order) reflection of the grating at wavelength X, angle of incidence G, and 
offset 4. The offset ^ is the distance between centerlines of lower and upper grating lines as 
15 shown in Figure 1 . The function R(A.,0, ^) is periodic with respect to the offset ^: 

R(X,e,^) = R(X,0,^+P) (1) 
where P is the period of the grating. The scatterometry-based overlay measurements 
are even more ambiguous when the profiles of the grating lines are symmetric. Then, a 
consequence of reciprocity is: 
20 R(?t,G,^) = R(M,-^) (2) 

R(X,e, (P/2)+^) = R(X,e, (P/2).^) (3) 
There are two values of offset, separated by P/2, where the optical properties become 
ambiguous. Optical properties are insensitive to overlay at these points. Therefore, the 
largest measurement range is P/2 when the grating lines have symmetric cross-sections. If 
25 the unit cell of the grating is substantially asymmetric, the measurement range becomes P, a 
whole period. Huang et al., cited above, describe means of manufacturing a grating with an 
asymmetric unit cell. The measurement range also becomes a whole period if two grating 
stacks are used, the offset of each stack is biased, and the two offset biases differ by P/4 (as 
described in U.S. Application Serial No. 10/613,378, filed July 3, 2003, incorporated in this 
30 document by reference). 
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Figure 2 shows the calculated near-normal incidence, unpolarized reflectance of a 
grating stack as a function of offset ^. Each curve corresponds to a different wavelength. 
The ambiguity described by Equations (2) and (3) is clearly visible in Figure 2. The ordinate 
of the graph in Figure 3 shows overlay measured using diffraction gratings with a l)am 
5 period and 1 : 1 line to space ratio. Overlay was obtained by fitting the reflection spectra of a 
pair of gratings with rigorous coupled wave analysis. The abscissa shows the best estimate 
of the actual value of overlay, which is the sum of the overlay that was intentionally written 
to the reticle that produced the gratings and the overlay that was imparted by the lithography 
stepper-scanner. Overlay measured using gratings is not valid when it approaches or exceeds 
10 ±P/4 (250nm in this example). Figure 4 is an abstraction that shows the periodic nature of 
the overlay measured using diffraction gratings. The overlay measured by the grating, 
^QiATiNG ' related to the actual overlay as follows: 



A^ = int 



^ACTUAL 

PI2 



(4) 



^GRATING ~ (~0 [^ACTUAL ~ ^ ^\ 

In Equation (4), int[x] denotes the integer nearest to x. In prior art, the integer N is 
15 unknown. Therefore, Ax^j^ting represents Ax^ctual when N is zero, that is, when 
|^actual|<^/4. Ax 

GRATING ' which is also called fine-overlay measurement, has high 
precision but it is only accurate when N=0. Gross overlay is defined as the condition 
I Ax^cTUAL I ^ ^ / 4 or I Ay^CTUAL | ^ ^ / 4 , that is, any one component of overlay exceeding a 
quarter of the period. Gratings cannot detect gross overlay until overlay gets so large that the 
20 upper and lower gratings do not overlap in part of the measurement spot. Although gross 
overlay is rare in well-tuned lithography processes, alignment errors larger than 100 nm, 
even as large as several microns occur when a new process, a new reticle, or a new projector 
is introduced. In these instances, there is a need to not only detect but also to measure gross 
overlay. 

25 Although increasing the period increases the measurement range, P/2, this approach is 

not preferred because it reduces the sensitivity of the optical response of grating stacks to 
overlay. As the period is increased, if the period becomes a significant fraction of the 
diameter of the measurement spot, the placement of the spot on the grating affects the 
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overlay measurement and reduces its precision. For these two reasons, increasing the period 
to increase the measurement range is counter-productive. Using more than one grating stack, 
each with a different period, reduces but does not eliminate ambiguity. 

SUMMARY 

5 An embodiment of the present invention provides a method and apparatus for 

measuring overlay. For a typical implementation, one or more grating stacks are included in 
a semiconductor wafer. Each grating stack includes an upper grating and a lower grating, 
each formed on one of the two layers for which overlay is to be measured. Each grating 
typically has a square or rectangular shape in the X-Y plane defined by the surface of the 

10 semiconductor wafer. At least one of the gratings stacks is configured so that its upper 
grating is larger or smaller (in the X or Y direction) than its lower grating. 

For the method of the present invention, gross overlay is first measured by analyzing 
one or more grating stacks. Each gross overlay measurement is obtained by measuring, using 
optical microscopy, a grating stack that includes upper and lower gratings of different size. 

1 5 A fine (i.e., non-gross) overlay measurement is then obtained by scatterometry. Typically, 
this means that a probe beam is directed at one or more of the grating stacks and the resulting 
diffraction is analyzed to determine the fine overlay between the upper and lower layers 
being analyzed. 

The gross and fine overlay measxirements are then combined in a way that is 
20 consistent with each measurement. Typically, this means that the gross overlay measurement 
is normalized to locate a starting point for the fine overlay measurement. The fine overlay 
measurement is then added or subtracted to that starting point to define a total overlay 
measurement. 

BRIEF DESCRIPTION OF THE DRAWINGS 

25 Figure 1 shows in cross-section the stack of two gratings that are used in the 

diffiraction-based overlay metrology in the prior art. 

Figure 2 shows calculated near-normal specular reflectance of the grating stack in 
Figure 1 as a fimction of offset ^ for four different wavelengths. 

Figure 3 shows the overlay measured using grating stacks as shown in Figure 2 
30 according to the prior art. 
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Figure 4 is a graph that simpHfies and illustrates the periodic nature of the graph in 
Figure 3. 

Figure 5 shows overlay metrology target according to an embodiment of the present 
invention. 

5 Figure 6 shows another embodiment for the overlay metrology target configured, in 

this case to measure overlay in two directions using a single grating stack. 

Figure 7 shows another embodiment for the overlay metrology target configured, in 
this case using 2-dimensional arrays of three-dimensional features such as contact holes, 
posts, islands. 

10 Figure 8 shows another embodiment for the overlay metrology target configured, in 

this case to measure overlay in two directions using a single grating stack with each layer in 
the grating stack being the same size. 

Figure 9 shows a schematic diagram of a measurement instrument. 
Figure 10 shows the measurement uncertainties for three embodiments of the present 
1 5 invention as a function of actual overlay. 

DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS 

Referring to Figure 5, overlay target 1 10 contains at least one grating stack 11 Ox that 
measures the x-component of the overlay and at least one grating stack 1 lOy that measures 
the y-component of the overlay. The grating stack 1 lOx and 1 lOy are, for example, placed in 

20 the scribe lines, or alleys, between dies on a semiconductor wafer. Grating stack 1 lOx and 
1 lOy are measured preferably one at a time (sequentially) by an optical instrument. The 
measurement spot of the optical instrument is 140x when it measures grating stack 1 lOx. 
The measurement spot then moves to 140y to measure grating stack 1 lOy. Alternatively, the 
measurement instrument has two measurement spots 140x and 140y that perform spatially 

25 separated but simultaneous measurements. Grating stack 1 1 Ox has a lower grating 120x and 
an upper grating 130x. Gratings 120x and 130x differ at least in one dimension. In Figure 5, 
the upper grating 130x is shown to be shorter than grating 120x in the direction of the grating 
lines. The dimensions of the gratings are such that, both gratings 120x and 130x completely 
cover the measurement spot 140x. The same apply to grating stack 11 Oy, lower 

30 grating 120y, upper grating 130y, and measurement spot 140y. The distances xi, X2, yi, and 
y2 shown in Figure 5 are the widths of the strips where only one of the gratings is visible 
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looking down on the wafer. The x-component of the gross overlay is obtained as 
AX=(xi-X2)/2. Process induced errors that affect the distances xi and X2 in the same manner 
are canceled by the differencing operation. In the alternative embodiment shown in Figure 6, 
the lower and upper gratings differ in both dimensions. The distances xi, X2, yi, and y2 are 
5 measured on the image of one or more grating stacks. 

Figure 7 shows an altemative embodiment where the lower and upper gratings are 2- 
dimensional arrays of features such as contact holes, posts, islands or other 3-dimensional 
features. The distances xi, X2, yi, and y2 are measured on the image of one or more grating 
stacks. Figure 8 shows an altemative embodiment where the upper 130x and lower 120x 

10 gratings have the same dimensions. The lower and upper gratings are preferably printed with 
an offset bias that is large enough to be measured by the imaging system. The overlay is 
obtained as AX=(xi-i-X2)/2 - (offset bias). This embodiment is not preferred because gross 
overlay is not obtained from the difference of two similar distances. 

A vision system 200, such as the one shown in Figure 9, is used to navigate and find 

15 the metrology targets on the wafer 1 80 and also to measure the distances xi, X2, yi, and y2. 
The vision system consists of a light source 220, condenser or illumination optics 230, beam 
splitter 240, objective 250, tube-lens 260, camera 270 with a CCD or CMOS array detector, 
image processor 280. 

In the preferred implementation, the vision system 220 is separate from the system 

20 that measures the optical properties of the grating as a function of wavelength or angle of 
incidence or both. For example. Figure 9 shows an oblique incidence ellipsometer 300 that 
measures the optical properties of the gratings. The components of ellipsometer 300 are 
broadband light source 310, collimator 320, rotary polarizer 330, optional rotary 
waveplate 340, refractive or reflective illumination objective 350, collection objective 360, 

25 optional rotary waveplate 370, rotary polarizer 380, re-imaging optics 390, pinhole 392, 

grating spectrometer 394. Optical components 230, 250, 260, 320, 350, 360, and 390 can be 
compound refractive or reflective lenses. A normal or oblique incidence, polarized or 
unpolarized reflectometer can also be used to measure the optical properties of the grating 
stacks. The spectroscopic metrology system can also share its optical path with the vision 

30 system (not shown in Figure 9). 
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The vision system measures the distances xi, X2, yi, y2j and some known and well- 
controlled distance Gx, and Gy to calibrate the pixels in micrometers. In the simplest 
example, Gx, and Gy can be the x and y dimensions of a grating as shown in Figure 5. This 
embodiment is not preferred because line end shortening would affect Gy. In a more 
5 preferred embodiment, the calibration distances Gx, and Gy are distances between identical 
features. The following differences are formed from the measurements of the vision system: 
AXvisiON = (xi-X2)/2 (5) 
AYvisiON = (yry2)/2 (6) 
This is similar to the box-in-box measurements in imaging-based prior art except the 
10 boxes are not stacks of thin films but stacks of gratings. The overlay measurement 

(AXvisioN > AYvision)? which is a measurement of gross overlay, is less precise than grating 
measurements but it has a significantly larger measurement range. The range of the overlay 
measurement by the vision system is equal to the distance from the grating stack to the edge 
of the field of view of the camera or the distance from the grating stack to the nearest 
15 structure on the wafer, whichever distance is smaller. Typically, this range is at least 2 
micrometers. 

Embodiment-l for combining image and diffraction-based overlay measurements 

In one embodiment of the invention, overlay is measured as: 

P 
4 



Ar 



^GRATING \^V1SI0N \ ^ ' 



^VISION \^V7SIOn\ — 



(7) 

P ^ ^ 



4 

20 The total measurement uncertainty of the vision-based overlay measurement, 

TMUyj^jQf^ , is preferably no greater than P/8. The y-component of overlay is calculated in an 
analogous fashion. In this embodiment, the measurement uncertainty is determined by the 
diffraction-based measurement when | Ax^ctt/^l | < P / 4 - TMUyj^joN • Th^ measurement 
uncertainty is determined by the vision system when {^actual | > ^ / 4 + TMUyj^jQj^ . 

25 Curve 400 in Figure 10 shows a Monte-Carlo simulation of the measurement uncertainty for 
this embodiment. In this simulation, the measurement uncertainties of the vision and grating 
systems are ±P/8 and ±P/1000 respectively. Random measurement errors were uniformly 
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distributed in a ±P/8 neighborhood for the vision-based measurement and a ±P/1000 
neighborhood for the diffraction-based measurement. 

Embodiment-2 for combining image and diffraction-based overlay measurements 

In another embodiment of the invention, the vision and diffraction-based overlay 
measurements are combined as follows: 

(8) 



M = int 



PI2 

^KfEAs Ax^™ +MP/2 

M is an estimate of the integer N in Equation (4). The estimate is obtained from the 
vision-based overlay measurement. Given that overlay is in half-period, a high-precision 
^MEAs obtained from the diffraction-based measurement as described by the second 

10 equation in (8). The y-component of overlay is obtained in an analogous fashion. In this 
embodiment, the precision of overlay is determined by the diffraction-based measurement 
except at the dead-points of the diffraction-based measurement where the diffraction based- 
measurement is not sensitive to overlay. For a single grating stack with an offset bias of P/4, 
or for two grating stacks with offset biases of +P/4 and -P/4, sensitivity to overlay 

1 5 diminishes when overlay is an odd multiple of P/4 as seen in Figures 3 and 4. In this case, 
^mAs according to Equation (8) has the precision of the diffraction based measurement for 
all overlay values except the ones close to odd multiples of P/4. When overlay is in a 
TMUyjsjQj^ wide neighborhood of an odd multiple of P/4, the precision of ^j^^s ^q^al to 
that of AXyjgjQj^ . TMUyjsjoN is required to be no larger than P/8 in this embodiment. The 

20 measurement uncertainty for this embodiment is shown by curve 410 in Figure 10. 

Embodiment-3 for combining image and diffraction-based overlay measurements 

In this embodiment, at least two grating stacks are used to measure overlay in one 
direction. The two gratings differ in offset bias by P/4. They yield the following diffraction- 
based overlay measurements, not accoimting for measurement errors: 
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iV, = int 



Ax, 



ACTUAL 



Ax, 



GRATING-1 



PI2 J 

= (-lf'[Ax^CTUAL-^.^/2] 



=int 



Ax, 



ACTUAL 



+ (P/4) 



PI2 

^GRATING-2 ~ ("0 ^ [^ACTUAL 



■{PIA)-N^PI2\ 



The overlay measurements are combined as follows: 

AxTv 



M, =int 



M2 =int 



"•VISION 



PI2 

^VISION +(^/ 4) 



PI2 



Ax, 



Ax, 



(-1^' +M,(i'/2) if lAx, 



GRATING-I 



(8) 



(9) 



Ax, 



(-l)^^+M2(P/2)-(P/4) iflAx, 



GRATING-2 



GRATING-1 1 '^P 1^ 

I > P/8 
GRATING-1 — ' O 



Provided that TMUyj^j^^ is no greater than P/8, the precision of Ax^^^ is equal to the 

precision of the diffraction-based overlay even for gross overlay. The measurement 
uncertainty for this embodiment is shown by curve 420 in Figure 10. 
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